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[0001] This invention was made with Govemmeat support under Program Project 
Grant P01CA76259, P0ICA81534, and P30CA56036 from the National Cancer Institute. The 
Government has certain rights in this invention. 

CROSS REFERENCE TO RELATED APPLICATIONS 

[0002] This jq^lication claims the benefit of U.S. Provisional Application No. 
60/417,842, filed October 1 1, 2002, incorporated herein by reference in its entirety. 

FIELD OF THE INVENTION 

[0003] The invention relates to the identification and cloning of ARTS 1, a novel tumor 
suppressor gene, and to methods of making and using the same. ARTSl was originally 
designated ARLTSl, and may also be referred to as ARLSl. 

BACKGROUND OF THE INVENTION 

[0004] Loss-of-function mutations in tumor suppressor genes (TSGs) play an essential 
role in the initiation and progression of human tumors, while inactivation by me&ylation seems 
to be important for tumor progression (Weinberg, R.A. Tumor siq)pressor genes. Science 254, 
1 138-46. (1991), which is incorporated herein by reference). Chromosome 13 at band ql4, 
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where the retinoblastoma (RBI) gene (Marshall, C J. Tumor suppressor genes. Cell 64, 313-26. 
(1991), which is incorporated herein by reference) is located, is hemi2ygously or homozygously 
deleted in a diversity of hematopoietic and solid tumors (Bulhich, F. & Croce, CM. Molecidax 
biology of chronic lymphocytic leukemia. In Chronic Lymphoid Leukemias (ed, Chenson, BID.) 
9-32 (Marcel Dekker, Inc., New York Bassel, 2001), which is incorporated herein by reference). 
Several reports presented evidrace for a new tumor suppressor locus telomeric to the RBI gme 
(Brown, A.G,, Ross, F.M., Dume, E.M., Steel, CM. & Weir-Thompson, B.M. Evidence for a 
new tumour suppressor locus (DBM) in human B-cell neoplasia telomeric to the retinoblastoma 
gene. Nat Genet 3, 67-72. (1993), Howthom, L.A., Chapman, R., Oscier, D. & Cowell, J.K. 
The consistent 13ql4 translocation breakpoint seen in chronic B-cell leukaemia (BCLL) involves 
deletion of the D13S25 locus which hes distal to the retinoblastoma predisposition gene. 
Oncogene 8., 1415-9 (1993) and Liu, Y. et al. Chronic lymphocytic leukemia cells with allehc 
deletion at 13ql4 commonly have one intact RBI gene: evidence for a role of an adjacent locus. 
Proc Natl Acad Sci U S A 90, 8697-701. (1993) which are each incorporated herein by 
reference). However, none of the genes in the region were foimd to be inactivated by either 
combination of deletion, mutations or promoter hypermethylation. 

[0005] There is a need to identify and clone TSGs whose loss of function are associated 
with initiation and progression of human tumors. There is a need to identify a TSG telomeric to 
the RBI gene. There is a need to identify nucleic acids which can serve as probes or primers for 
the detection of the TSG. There is a need for genetic based therapeutics which can be deUvered 
to function in cells with a TSG mutation. There is a need for isolated protein and for antibodies 
which specifically react to the protein. There is a need for assays, reagents and kits to identify 
compounds that can upregulate, enhance or compensate for inactivity of the TSG. There is a 
need to study and understand the mechanisms by which the TSG is involved in initiation and 
progression of tumors and for reagents useful in such studies. There is a need to identify new 
cancer therapeutics and for kits and methods of identifying such compounds. 

SUMMARY OF THE INVENTION 

[0006] The invention relates to isolated proteins comprising the amino acid sequence 
shown in SEQ ID NO:2. 
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[0007] The invention relates to isolated nucleic acid molecules that comprise nucleic 
t acid sequences that encode a protein that has an amino acid sequence shown ia SEQ ID NO:2. 

[0008] The invention relates to isolated nucleic acid molecules that comprise SEQ ID 
* NO:l or a firagment thereof having at least 10 nucleotides, 

[0009] The invention relates to a recombinant expression vector comprising the nucleic 
acid molecule comprising SEQ ID NO:l. 

[0010] The invention relates to a host cell comprising a recombinant expression vector 
comprising the nucleic acid molecule that comprises SEQ ED NO:l. 

[0011] The invention relates to an oligonucleotide molecule comprising a nucleotide 
sequence complimentary to a nucleotide sequence of at least 5 nucleotides of SEQ ID NO: 1 . 

[0012] The invention relates to isolated antibodies that biad to an epitope on SEQ ID 

NO:2. 

[0013] The invention relates to methods of identif/ing modulators of Caspase-1 
protease activity. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0014] Figure 1. Localization of ARTSl tumor suppressor gene at 13ql4, A. The 
position of genetic markers and positions of genes on the map is shown. B. A multiple 
alignment of human ARTSl (SEQ ED NO:2) with human ARL proteins. Several motifs 
presumably involved in nucleotide binding and hydrolysis (PMl, PM3, G2 and G3), 
characteristic of Ras-related GTPases27, are also present in ARTSL Furthermore, five 
additional aminoacids typical of the ARF subfamily (G2, N47,W74, R95 and G161) are aU 
conserved in ARTSl. In the C-terminus ARTSl harbors less arginine or lysine residues than 
ARL4, ARL6 and ARL7. The location of Tip 149Stop mutation is indicated by an arrow. 

[0015] Figure 2. ARTSl mRNA expression and methylation analysis. A. Expression 
of ARTSl by Northern blotting in cancer cell lines show absent or reduced expression in several 
cell lines. B. ARLTSGl expression correlates with the level of methylation of this locus 
analyzed by Southern blotting of digested genomic DNA with Bgin alone or in combination with 
HpaH. The combination BglH+MspI was used to determine the firagment length without respect 
of methylation. The presence or absence of ARTS 1 expression is shown by or 
respectively and the restriction map (Bglll-B-thick vertical lines, Hpall-thin vertical lines) is 
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drawn at the bottom. The position oftheORF probe used is indicated by *. C Correlation 
betwem ARTSl expression analyzed by RT-PCR and CpG sites mefliylation analyzed by 
bisulfite sequencing in fi-esh tumors; white and black rectangles represent unmethylated and 
hypermethylated CpGs respectively, while gray rectangles represent partially methylated CpG 
sites. As control we use Epstein-Barr Virus transformed lymphoblastoid cell lines. 

[0016] Figure 3. ARTSl suppresses tumorigenicity and A549 cells. A. Restoration of 
ARTSl expression by transfection of the minig^e into A549. B, Tumor formation in nude 
mice. The weight (mg) of tumors for the five analyzed clones determined at the indicated times 
are shown. The same results were obtained by measurement of tumors. C. Example of 
tumorigenesis in nude mice at 8 weeks after s.c. injection of 10 6 cells. C. Colony growth in 
soft agar (data at 21 days after plating 5x10 4 cells). 

[0017] Figure 4. Analysis of ARTSl expression in human tissues by Northern blotting 
reveals that ARTSl is ubiquitously expressed. 

[0018] Figure 5. Mutation analysis in ARTSl shows the presence of the germline 
polymorphism G446A (Trpl49Stop). The presented sequences are in reverse orientation. For 
identification of the G446A (Trpl49Stop) mutation a rapid assay was developed using the Mael 
site mtroduced by the mutation. DNA was amplified using primers Mael-Fl (which contains a 
changed base firom the wUd-type sequence to destroy a constitutive Mael site) and Mael-Rl (for 
sequences of the primers, see Table 4), purified using QIAquick PGR purification kit (QIAGEN) 
and digested with 2U of Mael (Boehringer Manhiem, Germany). The amplification of a normal 
allele gives rise to a single 138bp product, while the mutant allele produces two bands of 106 and 
32bp. Note that the digestion has low efficiency and only partial digestion products were 
obtained. Digested PGR products were loaded on 3% agarose gel and visualized using a UV 
imager. N=normal and T=tumor, 

[0019] Figure 6. Both wild-type ARTSl and the truncated ACARLTS 1 proteins are 
localized in cytoplasm and nucleus. Subcellular localization using ARTSl-GFP fiision protein 
293 cells were transfected with pARLTSl-gQ), pAC-ARLTSl-g]^ and control plasmid, 
pEGFPNl. Bright field (right) and fluorescence images (left) of the same microscopy field are 
presented. 

[0020] Figure 7. The sequence of the cDNA of human ARTSl (SEQ ID NO: 1) is 
shown. The GenBank Accession number for the sequence is AF441378. 



WO 2004/033659 



-4- 



wo 2004/033659 PCT/US2003/032270 



DETAILED DESCRIFnON OF ILLUSTRATIVE EMBODIMENTS 

[0021] The invention arises from the identification of ARTS 1, a novel member of the 
ADP-ribosylation factor family. ARTSl is located at 13ql4, and displays features characteristic 
of a TSG. ARTSl is downregulated by hypermethylation in 25 out of 75 (33%) hnman primary 
tumors and cell lines analyzed. Furthermore, analysis of 800 tumor and normal DNAs revealed 
the presence of several variants including a gennline nonsense polymorphism G446A 
(Trpl49Stop) that is three times more frequent in cancer patients with a family history of cancer 
than in the normal population. Restoration of wild-type ARTSl expression in A549 cells, which 
shows low levels of expression, suppresses tumor formation of A549 cancer cells. 

[0022] The GenBank accession number of the human ARTSl cDNA is AF441378. 
During the final stages of the fimctional studies described below, a clone of 1.6kb, BC013150, 
containing the ORF of ARTSl and encoding the hypothetical protein FLJ22595 (accession 
number AAH13150) was deposited in the GenBank. 

[0023] The ARTSl gene, and proteins, polypeptides, or peptides encoded by the gene, 
can be used in methods of preventing abnormal cell growth in mammalian subjects. Such 
methods involve administering to a mammal a composition comprising an effective amount of 
the ARTSl protein. Such methods also involve administering to a mammal a composition 
comprising an ex5)ression vector comprising a gene encoding ARTSl. 

[0024] The discovery of ARTSl provides the means to study its fimction as a TSG, to 
design probes and primers to detect its presence and/or to detect mutants, to prepare isolated 
nucleic acid molecules, to insert the nucleic acid molecules that encode ARTSl into vectors such 
as cloning vectors to produce multiple copies, expression vectors useful to transform cells that 
will produce the protein and gene therapy vectors which can be used treat patients with tumors 
arising from a lack of endogenous ARTSl function. Antisense compoimds may be produced to 
generate tumor cells that lack ARTSl fimction that can be used in assays to identify compounds 
usefiil to treat such cancers. Assays and kits can also be provided to identify compounds that 
upregulate or enhance ARTS 1 activity. Transformed host cells may be used in methods to 
produce ARTSl protein. Antibodies can be prepared that specifically bind to ARTSl protein 
and used to isolate or detect the protein including to distinguish wild type from mutants. 

[002S] In certain embodiments, the present invention provides isolated ARTS 1 protein 
that comprises the amino acid sequence shown in SBQ ID NO:2. The ARTSl protein can be 
isolated from natural sources, produced by recombinant DNA methods or synthesized by 
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Standard protein synthesis techniques. In other embodiments, the invention relates to ARTSl- 
like polypeptides, which are polypeptides that are similar to, but differ from, the ARTSl 
polypeptide by having at least one amino acid substitution or deletion. For example, 
consCTvative amino acid substitutions may be made at one or more nonessential amino acid 
residues of the ARTSl protein to generate ARTSl-like polypeptides. A "nonessential" amino 
acid residue is a residue that can be altered from the wild-type sequence of ARTSl protein (e.g., 
the sequence of SEQ ID NO:2) without altering the biological activity, whereas an "essential" 
amino acid residue is required for biological activity. A "conservative amino acid substitution" is 
one in which the amino acid residue is replaced with an amino acid residue having a similar side 
chain. Families of amino acid residues having similar side chains have been defined in the art. 
These famiUes include amino acids with basic side chains (e.g., lysine, arginine, histidiue), acidic 
side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, 
asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side 
chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine). Such substitutions would not be made for conserved amino acid residues, 
or for amino acid residues residing within a conserved motif. 

[0026] Antibodies which specifically bind to the ARTSl protein may be used to purify 
the protein from natural sources using well known techniques and readily available starting 
materials. Sxich antibodies may also be used to purify ARTSl from material present when 
producing the protein by recombinant DNA methodology. As used herein, the term "antibody" 
is meant to refer to complete, intact antibodies, and fragments including Fab fragments and 
F(ab)2 fragments. Complete, intact antibodies include monoclonal antibodies such as murine 
monoclonal antibodies, chimeric antibodies, primatized antibodes and humanized antibodies. 
Antibodies that bind to an epitope present on ARTSl are useftil to isolate and purify ARTSl 
from both natural soiirces or recombinant expression systems using well known techniques such 
as affinity chromatography. Such antibodies are usefiil to detect the presence of such protein in a 
sample and to determine if cells are e^ressing the protein. 

[0027] The production of antibodies and the protein stmctures of complete, intact 
antibodies, and firagments such as Fab fragments and F(ab)2 fragments and the organization of 
the genetic sequences that encode such molecules are well known and are described, for 
example, in Harlow, E. and D. Lane (1988) ANTIBODIES: A Laboratory Manual, Cold Spring 
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Harbor Laboratory, Cold Spring Harbor, NY. which is incorporated herein by reference. Briefly, 
for example, ARTSl or an immimogenic fragment thereof is injected into mice. The spleen of 
the mouse is removed, the spleen cells are isolated and fused with immortalized mouse cells. 
The hybrid cells, or hybridomas, are cultured and those cells that secrete antibodies are selected. 
The antibodies are analyzed and, if found to specifically bind to ARTSl, the hybridoma which 
produces them is cultured to produce a continuous supply of antibodies, 

[0028] According to some embodiments, the present invention relates to an isolated 
nucleic acid molecule comprising a nucleotide sequence that encodes the amino acid sequence of 
SEQ ID NO:2. Such molecules can be routinely designed using the information set forth in SEQ 
ID N0:2. In certain embodiments, the invention relates to an isolated nucleic acid molecule 
comprising SEQ ID NO:l. Nucleic acid molecules that are fragments of nucleic acid molecules 
comprising a nucleotide sequence that encode the amino acid sequence of SEQ ID NO:2 and of 
nucleic acid molecules comprising SEQ ID N0:1 are also encompassed by the present invention. 
By "fragment" is intended a portion of the nucleotide sequence encoding the ARTSl protein or 
an ARTSl -like polypeptide, A fragment of an ARTSl nucleotide sequence may encode a 
biologically active portion of an ARTSl-Uke protein, or it may be a fragment that can be used as 
a hybridization probe or PGR primer. A biologically active portion of an ARTS 1 -like protein can 
be prepared by isolating a portion of one of the nucleotide sequences of the invention, expressing 
the encoded portion of the ARTSl-hke protein (e.g., by recombinant expression in vitro), and 
assessing the activity of the encoded portion of the ARTSl-like proteinu Nucleic acid molecules 
that are fragments of an ARTSl -like nucleotide sequence comprise at least about 10, 15, 20, 50, 
75, 100, 200, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 
.1100, 1150, 1200, 1250, 1300, 1350, 1400, 1450, 1500, 1550, 1600, 1650, 1700, 1750, 1800, 
1850, 1900, 1950 nucleotides, or up to the number of nucleotides present in a ftiU-length 
ARTSl-like nucleotide sequence disclosed herein (for example, up to 3791nucleotides for SEQ 
ID NO:l), depending upon the intended use. Nucleic acid molecules that are variants of the 
ARTSl nucleotide sequences disclosed herein are also encompassed by the present invention. 
"Variants" of the ARTSl nucleotide sequences include those sequences that encode the ARTSl 
protein or ARTSl-like polypeptides disclosed herein but that differ conservatively because of the 
degeneracy of the genetic code. These naturally occurring allelic variants can be identified with 
the use of well-known molecular biology techniques, such as polymerase chain reaction (PCR) 
and hybridization techniques as outlined below. Variant nucleotide sequences also include 
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synthetically derived nucleotide sequences that have been generated, for example, by using site- 
directed mutagenesis but which still encode the ARTSl-like proteins. Generally, nucleotide 
sequence variants of the invention will have at least about 45%, 55%, 65%, 70%, 75%, 80%, 
85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:L 

[0029] Using standard techniques and readily available starting materials, a nucleic acid 
molecule that encodes ARTSl may be isolated from a cDNA library, using probes or primers 
which are designed using the nucleotide sequence information disclosed in SEQ ID NO; 1 . In 
some embodiments, the nucleic acid molecules comprise the nucleotide sequence that consists of 
the coding sequence in SEQ ID NOrl. In some embodiments, the nucleic acid molecules consist 
of the nucleotide sequence set forth in SEQ ID NO: 1 . The isolated nucleic acid molecules of the 
invention are useful to prepare constructs and recombinant expression systems for preparing 
ARTSl. 

[0030] A cDNA library may be generated by well-known techniques. A cDNA clone 
which contains one of the nucleotide sequences set out is identified using probes that comprise at 
least a portion of the nucleotide sequence disclosed in SEQ ID NO: 1. The probes have at least 
16 nucleotides, preferably 24 nucleotides. The probes are used to screen the cDNA library using 
standard hybridization techniques. Alternatively, genomic clones may be isolated using genomic 
DNA from any human cell as a startmg material. In certain embodiments, the present invention 
relates to isolated nucleic acid molecules that comprise a nucleotide sequence identical or 
complementary to a fragment of SEQ ID NO: 1 which is at least 10 nucleotides. In some 
embodiments, the isolated nucleic acid molecules consist of a nucleotide sequence identical or 
complementary to a fragment of SEQ ID NO:l which is at least 10 nucleotides. In some 
embodiments, the isolated nucleic acid molecules comprise or consist of a nucleotide sequence 
identical or complementary to a fragment of SEQ ID NO:l which is 15-150 nucleotides. In 
some embodiments, the isolated nucleic acid molecules comprise or consist of a nucleotide 
sequence identical or complementary to a fragment of SEQ ID NO: 1 which is 15-30 nucleotides. 
Isolated nucleic acid molecules that comprise or consist of a nucleotide sequence identical or 
complementary to a fragment of SEQ ID NO: 1 which is at least 10 nucleotides are useftil as 
probes for identifying genes and cDNA sequence having SEQ ID NO: 1, PGR primers for 
amplifying genes and cDNA having SEQ ID NO:l, and antisense molecules for inhibiting 
transcription and translation of genes and cDNA, respectively, which encode ARTSl having the 
amino acid sequence of SEQ ID NO:2. 
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[0031] The cDNA that encodes ARTSl may be used as a molecular marker in 
electrophoresis assays in which cDNA from a sample is separated on an electrophoresis gel and 
ARTSl probes are used to identify bands which hybridize to such probes. Specifically, SEQ ID 
NO:l or portions thereof^ may be used as a molecular marker in electrophoresis assays in which 
cDNA from a sample is separated on an electrophoresis gel and ARTSl specific probes are used 
to identify bands which hybridize to them, indicating that the band has a nucleotide sequence 
complementary to the sequence of the probes. The isolated nucleic acid molecule provided as a 
size marker will show up as a positive band that is known to hybridize to the probes and thus can 
be used as a reference point to the size of cDNA that encodes ARTSl. Electrophoresis gels 
useful in such an assay include standard polyacrylamide gels as described in Sambrook et al.. 
Molecular Cloning a Laboratory Manual, Second Ed. Cold Spring Harbor Press (1989) which is 
incorporated herein by reference. 

[0032] The nucleotide sequences in SEQ ID N0:1 may be used to design probes, 
primers and complementary molecules which specifically hybridize to the unique nucleotide 
sequences of ARTSl. Probes, primers and complementary molecules which specifically 
hybridize to nucleotide sequence that encodes ARTSl may be designed routinely by those 
having ordinary skill in the art. As used herein, the term "q)ecifically hybridize to nucleotide 
sequence that encodes ARTSl" is meant to refer to nucleic acid molecules with unique 
nucleotide sequences that hybridize to ARTSl encoding sequences but not other known protein 
encoding sequences, such as sequences identical to portions of SEQ ID NO:l. This, the unique 
sequences described hereia are those that do not overlap with known sequences. 

[0033] The present invention also includes labeled oligonucleotides that are useful as 
probes for perfonning oligonucleotide hybridization methods to identify ARTSl . The 
oligonucleotides include sequences that specifically hybridize to nucleotide sequences that 
encode ARTSL Accordingly, the present invention includes probes that can be labeled and 
hybridized to unique nucleotide sequences that encode ARTSl. The labeled probes of the 
present invention are labeled with radiolabelled nucleotides or are otherwise detectable by 
readily available nonradioactive detection systems. In some preferred embodiments, probes 
comprise oligonucleotides consisting from 10 to 100 nucleotides. In some preferred 
embodiments, probes comprise oligonucleotides consisting of from 10 to 50 nucleotides. In 
some preferred embodiments, probes comprise oligonucleotides consisting of from 12 to 20 
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nucleotides. The probes preferably contain nucleotide sequence completely identical or 
complementary to a jfragment of a unique nucleotide sequences of ARTS 1. 

[0034] PGR technology is practiced routinely by those having ordinary skill in the art 
and its uses in diagnostics are well known and accepted. Methods for practicing PGR technology 
are disclosed in "PGR Protocols: A Guide to Methods and Applications", Innis, M.A., et al. Eds. 
Academic Press, Inc. San Diego, GA (1990) which is incorporated herein by reference. 
Apphcations of PGR technology are disclosed in *Tolymerase Chain Reaction" Erhch, H.A., et 
al., Eds. Gold Spring Harbor Press, Gold Spring Harbor, NY (1989) which is incorporated herein 
by reference. Some simple rules aid in the design of efficient primers. Typical primers are 18- 
28 nucleotides in length having 50% to 60% g+c composition. The entire primer is preferably 
complementary to the sequence it must hybridize to. Preferably, primers generate PGR products 
having from 100 base pairs to 2000 base pairs. However, it is possible to generate products of 50 
base pairs to up to 10 kb and more. 

[0035] PGR technology allows for the rapid generation of multiple copies of nucleotide 
sequences by providing 5' and 3* primers that hybridize to sequences present in a nucleic acid 
molecule, and further providing free nucleotides and an enzyme which fills in the 
complementary bases to the nucleotide sequence between the primers with the free nucleotides to 
produce a complemmtary strand of DNA. The enzyme will fill in the complementary sequences 
adjacent to the primers. If both the 5' primer and 3' primer hybridize to nucleotide sequences on 
the complementary strands of the same fragment of nucleic acid, exponential amplification of a 
specific double-stranded product results. If only a single primer hybridizes to the nucleic acid 
molecule, linear amplification produces single-stranded products of variable length. PGR 
primers include at least one primer which includes a nucleotide sequence that specifically 
hybridizes to nucleotide sequence that encodes ARTSL 

[0036] One having ordinary skill in the art can isolate the nucleic acid molecule that 
encode ARTSl and insert it into an expression vector using standard techniques and readily 
available starting materials. 

[0037] The present invention relates to a recombinant e>q)ression vector that comprises 
a nucleotide sequence that encodes ARTSl that comprises the amino acid sequence of SEQ ID 
NO:2, As used herein, the term '^recombinant expression vector" is meant to refer to a plasmid, 
phage, viral particle or other vector which, when introduced into an appropriate host, contains 
the necessary genetic elements to direct expression of the coding sequence that encodes the 
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ARTS 1 of the invention. The coding sequence is operably linked to the necessary regulatory 
sequences. Expression vectors are well known and readily available. Examples of expression 
vectors include plasmids, phages, viral vectors and other nucleic acid molecules or nucleic acid 
molecule containing vehicles useful to transform host cells and facilitate expression of coding 
sequences. In some embodiments, the recombinant expression vector comprises the nucleotide 
sequence set forth in SEQ ID NO:l. The recombinant expression vectors of the invention are 
usefid for transforming hosts to prepare recombinant expression systems for preparing ARTSl . 

[0038] The present invention relates to a host cell that comprises the recombinant 
expression vector that includes a nucleotide sequence that encodes ARTSl that comprises SEQ 
ID NO:l . In some embodiments, the host cell comprises a recombinant expression vector that 
comprises SEQ ID NO: 1 . Host cells for use in well known recombinant expression systems for 
production of proteins are well known and readily available. Examples of host cells include 
bacteria cells such as E. coli, yeast cells such as S. cerevisiae, insect cells such as S. frugiperda, 
non-himian mammalian tissue culture cells Chinese hamster ovary (CHO) cells and himian tissue 
culture cells such as HeLa cells. 

[0039] The present invention relates to a transgenic non-human mammal that comprises 
the recombinant expression vector that comprises a nucleic acid sequence that encodes ARTSl 
that comprises the amino acid sequence of SEQ ID NO:2. Transgenic non-human mammals 
useful to produce recombinant proteins are well known as are the expression vectors necessary 
and the techniques for generating transgenic animals. Generally, the transgenic animal 
comprises a recombinant expression vector in which the nucleotide sequence that encodes 
ARTSl is operably linked to a mammary cell specific promoter whereby the coding sequence is 
only expressed in. mammary cells and the recombinant protein so expressed is recovered firom the 
animal's milk. In some embodiments, the coding sequence that encodes ARTSl is SEQ ID 
NO:L 

[0040] In some embodiments, for example, one having ordinary skill in the art can, 
using well known techniques, insert such DNA molecules iato a conunercially available 
e?q)ression vector for use in well known expression systems. For example, the commercially 
available plasmid pSE420 (Invitrogen, San Diego, CA) may be used for production of coUagen 
in B. coli. The commercially available plasmid pYES2 (Invitrogen, San Diego, CA) may, for 
example, be used for production ia S. cerevisiae strains of yeast. The conmierciaUy available 
MAXBAC™ complete baculovirus expression system (Invitrogen, San Diego, CA) may, for 
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example, be used for production in insect cells. The commercially available plasmid pcDNA I 
(Invitrogen, San Diego, CA) may, for example, be used for production in mammalian cells such 
, as Chinese Hamster Ovary cells. One having ordinary skill in the art can use these commercial 
expression vectors and systems or others to produce Caspase-1 using routine techniques and 
readily available starting materials. (See e.g., Sambrook et al.. Molecular Cloning a Laboratory 
Manual, Second Ed. Cold Spring Harbor Press (1989) which is incorporated herein by 
reference.) Thus, the desired proteins can be prepared in both prokaryotic and eukaryotic 
systems, resulting in a spectrum of processed forms of the protein. 

[0041] One having ordinary skill in the art may use other commercially available 
expression vectore and systems or produce vectors using well known methods and readily 
available starting materials. Expression systems containing the requisite control sequences, such 
as promoters and polyadenylation signals, and preferably enhancers, are readily available and 
known in the art for a variety of hosts. See e.g., Sambrook et al,. Molecular Cloning a 
Laboratory Manual, Second Ed. Cold Spring Harbor Press (1989). 

[0042] A wide variety of eukaryotic hosts are also now available for production of 
recombinant foreign proteins. As in bacteria, eukaryotic hosts may be transformed with 
e^qpression systems which produce the desired protein directly, but more commonly signal 
sequences are provided to effect the secretion of the protein. Eukaryotic systems have the 
additional advantage that they are able to process introns which may occur in the genomic 
sequences encoding proteins of higher organisms. Eukaryotic systems also provide a variety of 
processing mechanisms which result in, for example, glycosylation, carboxy-terminal amidation, 
oxidation or dCTivatization of certain amino acid residues, conformational control, and so forth. 

[0043] Commonly used eukaryotic systems include, but are not limited to, yeast, fungal 
cells, insect cells, mammalian cells, avian cells, and cells of higher plants. Suitable promoters 
are available which are compatible and operable for use in each of these host types as well as are 
termination sequences and enhancers, e.g. the baculovirus polyhedron promoter. As above, 
promoters can be either constitutive or inducible. For example, in mammalian systems, the 
mouse metallothionein promoter can be induced by the addition of heavy metal ions. 

[0044] The particulars for the construction of expression systems suitable for desired 
hosts are known to those in the art/ Briefly, for recombinant production of the protein, the DNA 
encoding the polypeptide is suitably ligated into the expression vector of choice. The DNA is 
operably linked to all regulatory elements which are necessary for expression of the DNA in the 
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selected host. One having ordinary skill in the art can, using well known techniques, prepare 
expression vectors for recombinant production of the polypeptide. 

[0045] The expression vector including the DNA that encodes the Caspase-1 is used to 
transform the compatible host that is then cultured and maintained under conditions wherein 
expression of the foreign DNA takes place. The protein of the present invention thus produced is 
recovered firom the culture, eitha: by lysing the cells or j&om the culture medium as appropriate 
and known to those in the art. One having ordinary skill in the art can, using well known 
techniques, isolate ARTSl that is produced using such expression systems. The methods of 
purifying ARTSl from natural sources using antibodies which specifically bind to ARTSl as 
described above, may be equally applied to purifying ARTSl produced by recombinant DNA 
methodology. 

[0046] Examples of genetic constructs include ARTSl coding sequence operably 
linked to a promoter that is functional in the cell line into which the constmcts are transfected. 
Examples of constitutive promoters include promoters from cytomegalovirus or SV40. 
Examples of inducible promoters include mouse mammary leukemia virus or metallothionein 
promoters. Those having ordinary skill in the art can readily produce genetic constructs useful 
for transfecting with cells with DNA that encodes ARTSl from readily available starting 
materials. Siich gene constructs are useful for the production of ARTS 1 . 

[0047] In some embodiments of the invention, transgenic non-human animals are 
generated. The traaasgenic animals according to the invention contain SEQ JD NO:l under the 
regulatory control of a mammary specific promoter. One having ordinary skill in the art xising 
standard techniques, such as those taught ia U.S. Patent No. 4,873,191 issued October 10, 1989 
to Wagner and U.S. Patent No. 4,736,866 issued April 12, 1988 to Leder, both of which are 
iacorporated herein by reference, can produce transgenic animals which produce ARTSl . 
Preferred animals are rodents, particularly rats and mice, and goats. 

[0048] In addition to producing these proteins by recombinant techniques, automated 
peptide synthesizers may also be employed to produce ARTSl . Such techniques are well known 
to those having ordinary skill in the art and are usefiil if derivatives fliat have substitutions not 
provided for in DNA-encoded protein production. 

[0049] One aspect of the invention relates to gene therapy, specifically "gene 
rqjlacement." Gene replacement* ' refers to the replacement of a mutated genetic element with a 
normal gene. The present invention provides methods of gene therapy that is a "gene 
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replacement" therapy. Generally the present gene replacement method involves inhibition of an 
abnormal ARTS-1 product coupled with replacement with the normal ARTS-1 gene. Generally, 
melhods of the present invention can be used to treat conditions associated with tumorogenesis 
related to a lack oforiiisufBcient amount of fimctional wild type ARTS-L Methods ofthe 
present invention may be used to replace the abnormal ARTS-1 gene with a normal ARTS-1 
gene. 

[0050] By normal ARTS-1 gene is meant any gene which, when encoded produces a 
biologically active, wild-type tumor suppressing ARTS-1 protein. By abnormal or mutant gene 
is meant any gene which, when encoded, does not produce a biologically active, wild-type 
ARTS-1 protein and /or is insufficiently present to perform a tumor suppression function. 

[0051] The term "DNA construct" as used herein refers to any DNA molecule which 
has been modiJEed such that the nucleotide sequences in the molecule are not identical to a 
sequence which is produced naturally. 

[0052] The term "expression vector", as used herein, is defined as a DNA construct 
which includes an autonomous site of replication, a site of transcription initiation, and at least 
one stractural gene coding for a protein which is to be expressed in a host organism. The 
expression vector will usually also contain appropriate control regions such as a promoter and 
tOTninator which control the expression of the protein in the host organism* Expression vectors 
ofthe present invention may include retroviral vectors such as the "double copy" vector. As one 
skilled in the art woiild recognize, the particular vector chosen depends partly upon the cell-type 
targeted. 

[0053] In preferred embodiments of the present invention the expression vector 
includes a promoter. Vectors encoding one or more riboz3anes should preferably utiUze a strong, 
RNTA polymerase HI type promoter. Useful promoters include, but are not limited to tRNA and 
SV40 promoters. Expression vectors of the present invention may also include homologous 
sequences with a host gene to provide for integration of the modified gene into the chromosome 
of the host. 

[0054] The term '^ifimctional expression vector'' as used herein is defined as an 
expression vector which contains at least one structural gene cassette coding for a protein which 
is to be expressed in a host organism and a regulatory cassette coding for a regulatory element. 
The regulatory cassette may code for any element which functions within the cell to inhibit the 
expression of one or more genes. In accordance with prefOTed embodiments ofthe present 
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invention the regulatory cassette codes for aa RNA fragment having ribozjnme activity eflTective 
to cleave a separate RNA molecule. 

[0055] Cassette, as used herein, refers to a discrete DNA fragment tiiat encodes a 
control region and a DNA sequence of interest such a structural protein. 

[0056] The term "plasmid" is used herein in accordance with its commonly accepted 
meaning, i.e. autonomously replicating, usually close looped, DNA. 

[0057] *llibozyme" as the term is used herein, refers to an en2yme which is made of 
RNA. Ribozymes are involved in the cleavage and/or hgation of RNA chains. In preferred 
embodiments of the present invention, "hammerhead ribozymes" are used. As described above, 
hammerhead ribozymes cleave the phosphodiester bond of a target RNA downstream of a GUX 
triplet where X can be C, U, or A. Hammerhead ribozymes used in methods of the present 
invention have a structural domain having the sequence 3'- 

CAAAGCAGGAGCGCCUGAGUAGUC-5' (SEQ ID NO:3) . Site specific regulatory elements 
such as site specific ribozymes are provided in accordance with the present invention. The 
ribozyme regulatory element is made site specific, having the sequence 3'-Xn- 
CAAAGCAGGAGCGCCUGAGUAGUC-Ym-5' ((SEQ ID NO:4), reported in 5' to 3' 
direction) where X and Y are complementary to regions of the target mRNA flanking the GUC 
site and n-Hm are generally from about 20 to about 35 RNA bases in length. n-Hn need not be of 
equal lengths although it is preferable that neither n nor m is less than about 10. 

[0058] Hammerhead ribozymes target the triplet GUC, For a gene of interest a target 
site can be identified by analyzing the gene sequence to identify GUC triplets. Computer 
analysis of secondary structure may assist in site selection. Denman, (1993), Biotechniques, 15, 
1090-1094. 

[0059] Vectors of the present invention may be delivered to a patient via methods 
known in the art. Retroviral mediated deUvery is particularly preferred in some embodiments of 
the invention. In vivo delivery by of retroviral vectors may be achieved, for example by i. v. 
injection of the retroviral vectors. A double balloon catheter may also be used for direct delivery 
of retroviral vectors to the patient. 

[0060] According to one aspect of the invention, compounds may be screened to 
identify compounds that inhibit or euhance Caspase-1 activity. Substrates of Caspase-1 include 
baculovims protein p35 and the Sf immunophillin FKBP46. Assays may be performed 
combining Caspase-1 with a substrate in the presence or absence of a test compound The level 
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of Caspase-1 activity in the presence of the test compoimd is compared to the level in the 
abs^ce of the test compoimd. If Caspase-1 activity is increased by the presence of the test 
compound, the test compoimd is an enhancer. If Caspase-1 activity is decreased by the presence 
of the test compound, the test compound is an inhibitor. In some embodiments of the invention, . 
the preferred concentration of test compound is from IpM to SOO^iM. A preferred concentration 
is from lOfiM to lOOfiM. In some preferred embodiments, it is desirable to use a series of 
dilutions of test compounds. 

[0061] Kits are included which comprise containers with reagents necessary to screen 
test compounds. Such kits include a container with Caspase-1 protein, a container with a 
substrate such as FKBP46 or p35, which is preferably a labeled substrate, and instructions for 
performing the assay. Kits may include a control inhibitor such as anti-Caspase-1 neutralizing 
antibodies. 

[0062] Combinatorial Ubraries may be screened to identify compounds that enhance or 
inhibit Caspase-1 activity. 

EXAMPLES 

[0063] EXOFISH (Roest Crollius, et al. Estimate of human gene number provided by 
genome-wide analysis using Tetaodon nigroviridis DNA s^equence. Nat Genet 25, 235-8. (2000), 
which is incorporated herein by reference) was used to scan 1.4 Mb of assembled genomic 
sequence at chromosome 13ql4 ^abuchi, H. et al. Cloning and characterization of CLLD6, 
CLLD7, and CLLD8, novel candidate genes for leukemogenesis at chromosome 13ql4, a region 
commonly deleted B-cell chronic lymphocytic leukemia. Cancer Res 61, 2870-7. (2001), 
Bullrich, F. et al. Characterization of the 13ql4 tumor suppressor locus in CLL: identification of 
ALTl, an alternative splice variant of the LEU2 gene. Cancer Res61. 6640-8, (2001), Lander, 
E.S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860-921. (2001), 
and Venter, J.C. et al. The sequence of the human genome. Science 291, 1304-51. (2001) which 
are each incorporated herein by reference) for putative genes. A 182 bp *ecore' (evoluntionary 
conserved region) coding for an aminoacidic sequence with high homology to several members 
of the ADP-ribosylation factor family was found. By using EST walking and RACE, the 
corresponding fuU-lengfli cDNA was obtained. Comparison with the genomic sequence 
indicated that the cloned cDNA, which was designated ARTSl (for ADP-Ribosylation factor- 
Like, putative Tumor Si5)pressor gene 1), derives from a small gene composed of two exons 
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separated by a 1.8 kb intronic sequence and spanning about 6 kb of DNA. Using LOH analysis it 
was found that this region was heterozygously deleted in a fraction of tumors between 10% 
(colon cancers) and 20% (B-CLL). The putative ORF, within the second exon, encodes a 196- 
anuno acid protein with a predicted molecular mass of 21 kDa. BLAST analysis and Conserved 
Domain search of protein databases revealed highly significant homology with the ADP- 
ribosylation factor (AKF) and ARF-like (ARL) protein subfamily of the ras family (Moss, J, & 
Vaughan, M. Molecules in the ARF orbit J Biol Chem 273, 21431-4. (1998), and Kahn, R.A., 
Der, CJ. & Bokoch, G.M. The ras superfamily of GTP-biriding proteins: guidelines of 
nomenclature. Faseb J 6, 2512-3 (1992), which are each incorporated herein by reference). At 
the protein level, related proteins share at most 45% identical amino acids. A multiple aUgnment 
with the CLUSTALW program indicates that ARTSl belongs to the subgroup formed by AJRL4 , 
ARL6 and ARL7 (Jacobs, S. et 1. ADP-ribosylation factor (ARF)-like 4, 6, and 7 represent a 
subgroup of the ARF family characterized by rapid nucleotide exchange and nuclear localization 
singnal, FEBS Lett 456, 384-8.(1999)) (Figure 1). 

[0064] Northern analysis of normal human tissues witii an ARTS 1 probe revealed 
ubiquitous expression of a 2.2 kb transcript. In some tissues, two additional minor bands of 
approximately 1,3 and 5.5 kb were detected resulting from the use of different polyadenylation 
sites (Figure 4). The expression of ARTSl was analyzed by Northem blot and/or 
semiquantitative RT-PCR in a set of 59 hematopoietic and solid tumor cell lines. ARTSl 
expression was significantly reduced or absent in 22% (7/32) of blood cancer cell lines, 78% 
(7/9) of lung cancer cell lines, 33% (2/6) of esophageal cancer cell lines and 22% of pancreatic 
cancer cell lines as well as in HeLa S3 (cervical carcinoma), SW 480 (colorectal cancer) and G- 
361 (melanoma) cell hnes. In addition, 4 out of 16 fresh tumor samples (25%, 2/7 lung 
carcinomas and 2/9 B-CLL) for which cDNA and/or RNA were available showed reduction or 
absence of ARTSl expression when compared to their normal tissue coimterparts (Figure 2 and 
Table 5). 

[0065] The possibiUty that, as occiurs with other cancer-related genes such as TSCLl 
(Kuramochi, M. et al. TSLCl is a tumor-suppressor gene in human non-smaU-cell lung cancer. 
Nat Genet 27, 427-30. (2001) which is incorporated herein by reference) or pl6 (Merlo, A. et al. 
5' CpG island methylation is associated with transcriptional silracing of the tumour suppressor 
pl6/CDKN2/MTSl in human cancers. Nat Med 1, 686-92. (1995) which is incorporated herein 
by reference), ARTSl is downregulated through hypermethylation of the putative promoter was 
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examined First, the global methylation level around ARTSl was analyzed by Southern blotting 
using cell lines for which expression data was available. The level of expression is correlated 
with the methylation status of the genomic region - cell lines with low or no ARTSl expression 
are highly methylated, while cell lines with normal levels of expression display only one 
methylated site (Figure 2). ARTS 1 DNA methylation patterns were examined in more detail 
through bisulfite sequencing to determine the methylation status of 5 CpG sites near the putative 
promoter sequences. Fresh tumor samples and tumor cell lines with low or absent ARTSl 
expression showed higher methylation levels than normal tissues or tumors with normal 
expression levels (Figure 2 and Table 5), 

[0066] During an initial mutation screening with 80 cell lines (including 70 used for 
gene expression and 10 melanoma cell lines), three mutations have been identified. The first, a 
missense mutation G446A (Trpl49Stop) is present in homozygosity in the MCF7 breast cell line 
and in heterozygosity in the HS776T pancreatic carcinoma cell line. Two heterozygous 
substitutions were identified in melanoma cell lines: a T50C (Metl7Thr) and a C262A 
(Leu88Met) in one of the patients with T50C (Table 1), 

10067] In order to establish the significance of these mutations three panels of samples 
were screened (Methods and Table 6). The first includes 216 hunaan tumors that were screened 
by direct sequencing of the ARTSl ORF. Eight cases carried the G446A (Tipl49Stop) 
mutation, including 3 breast cancers (3/48 of cases, 6^5%), 2 colorectal carcinomas (2/58, 
3.45%), 1 lung carcinoma (1/5, 20%), 1 thyroid tumor (1/65, 1.5%) and one idiopathyc 
pancytopenia. All tumor samples had both the wild-type and mutant alleles except for a breast 
tumor with LOH at the ARTSl locus, which was homozygous for the mutation. Sequencing of 
the ARTS 1 in paired normal tissues, which were available for three out of six tumors, revealed 
the same alteration in the gennline of patients. 

[0068] The second panel contains 109 blood DNAs firom patients with multiple cancers 
or with a family history of cancer screened by direct sequenciug. Six additional cases with the 
G446A (Trpl49Stop) were identified - 2 malignant melanomas -H prostate carcinoma cases (2/17, 
1 1,75%), 2 cases of familial CLL (2/17, 1 1.75%), 1 case of pancreatic + melanoma (1/6, 16.5%) 
and 1 breast cancer (1/69 of cases, 1.5%) (see Table 2 for family history). At the protein level, 
the stop codon inserts a premature termination 48 amino acids before tiae C-terminus, leading to 
the synthesis of a smaller protein with 148 instead of 196 aixiino acids (Fig. 1). Thus, the 
truncated protein lacks the C-terminus motif presumably involved in nucleotide binding and 
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hydrolysis characteristic of Ras related GTPases, one of the five additional amino acids typical 
of the ARF subfamily (Glyl61) and the putative nuclear localization signal. Furthermore, 
Trpl49, the site of the mutation is conserved in ARL4 and in 1 1 otiier ARF or ARF related genes 
including all six ARF genes, 

[0069] The third panel comprises the case-controls: allele frequency for the G446A 
(Trpl49Stop) mutation in three separate Caucasian cohorts was 2.10%, with variations between 
0.86% (1/1 16) in the US population and 3.44% (7/203) in the Italian population. Overall, 14 
patients out of 325 analyzed (4.63%) and 10 out of 475 normal controls (2.1%) had the stop 
mutation. The odds of G446A (Trpl49Stop) were 2.10 (95% CI 0.92 - 4.77) times higher in 
cancer patients versus controls. After stratification upon family history of cancer, this odds 
increase in the group with positive family history to 2.70 (95% CI 0.85 - 8.32) (Table 6). In 
addition to the G446A (Trpl49Stop), several other variants in the ARTSl gene were identified 
including a G490A (Glul64Lys) substitution in a thyroid adenoma (Table 1). Four mutations in 
a total of 64 thyroid adenomas and carcinomas analyzed were found (two C65T missenses, one 
G446A nonsense and one G490A missense). All four mutations were found in adenomas of 
follicular origin, whereas all samples of non-follicular hystotype (42/65, 65%) were wild-type. It 
is highly unlikely that this allelic distribution is random (P=0-01 at Fisher exact test). Also, a 
G446A homozygous patient in a family with CLL has thyroid adenoma (Table 2). Taken 
together, fixese observations raise the possibility that this ARTSl is involved in a portion of 
thyroid tumors with follicular histotype. 

[0070] ARTS 1 appears to be the first ARF family member reported to be altered in 
human cancers. Because of their nuclear localization signal (NLS), ARL4, ARL6 and ARL7 
appear to be cargo molecules transported via the trauslocators importin-i and a in the nucleus 
where they have yet unknown fimctions. Of note, ARTSl lacks a classical NLS at its C- 
teiminus, and probably contains an atypical NLS. Using GEP constructs, the wild-type ARTSl 
protein was shown to be localized both in the nucleus and in the cytoplasm. The mutant ARTSl 
AC-terminus protein has the same intracellular protein (Figure 6). ARTSl may be involved in 
novel cytoplasmic/nuclear membrane trafGcking and/or signaling cascades that are important in 
different types of cells. 

[0071] Northem and RT-PCR expression data showed that ARTS 1 expression was 
dramatically decreased in A549, a highly tumorigenic non-small cell lung carcinoma (NSCLC) 
cell line(Fogh, J,, Fogh, J.M. & Orfeo,T. One hundred and twenty-sevea cultured human tumor 
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cell lines producing tumors in nude mice. J Natl Cancer Inst 59, 221-6 (1977) which is 
incorporated herein by reference) when compared to the level found in normal lung. The ARTSl 
ORF under the control of the LTR promoter was transfected into A549. Several stable clones 
were obtained and five of them were used in e:q>eriments: parental A549, the A549-pMV-7 
(empty vector) clone, and three neomycin-resistant transfectants, selected according to the level 
of expression of the transfected ARTS 1 minigene (Figure 3). To evaluate the biological effect of 
ARTS 1 in vitro and in vivo, tumorigemcity was examined by soft agar and in Nu/Nu nude mice 
OPigure 3), All three transfected clones give rise to smaller colonies with a shorter survival in 
comparison with the parental cell or cells transfected with the empty vector. Furthermore, during 
10 weeks of observation after the s.c, injection, the former consistently formed smaller, 
nonprogressive tumors, while the latter formed large, progressively growing tumors in nude 
mice. Thus, ARTSl by itself has significant tumor-suppressor activity in A549 cells. 

[0072] The presence of a new tumor suppressor within the well-characterized 
superfamily of Ras oncogenes is not as contradictory. It was recently shown that wild type 
Kras2 could inhibit limg carciaogenesis in mice, clearly illustrating the tumor suppressor role of 
the gene in lung tumorigenesis (Zhang, Z. et al. Wildtype Kras2 can inhibit limg carcinogenesis 
in mice. Nat genet 29. 25-33. (2001) which is incorporated herein by reference). The principal 
mechanism for ARTSl inactivation in human cancers is biaUelic methylation, as was proposed in 
the revised Knudson's two hit hypotheses (Jones, P.A. & Laird, P.W. Cancer epigenetices comes 
of age. Nat Genet 21, 163-7. (1999) which is incorporated herein by reference). One intriguing 
aspect of ARTS 1 involvement in human cancer is the real significance of the G446A 
(Trpl49Stop) nonsense mutation. Because the frequency of G446A mutation is about three 
times higher in familial cancers as in the general population and about two times higher as in 
sporadic cancers, one possible explanation is that ARTSl germline mutations have low 
penetrance and are associated with a smaU percentage of familial melanoma or famiUal CLL 
cancers (which harbor a ten times higher frequency of the truncating mutation as in the same 
population control group). According to this, it is possible that there exists kindreds which carry 
the mutation but did not develop cancer. The same is true also for some other TSGs as is the 
case of BRCA2 germline mutations in breast and pancreatic cancers (Goggins, et al. 
Germline BRCA2 gene mutations in patients with appar^tly sporadic pancreatic carcinomas. 
Cancer Res 56, 5360-4. (1 996) which is incoiporated herein by reference). An alternative 
explanation is that this truncating mutation does not have a pathogenetic role in human cancers, 
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because the lost domains are not important for tumorigenesis or because the protein has 
redundant functions with other ARL family members. Until now, only one polymorphic stop 
codon was identified in cancer related genes, the Lys3326ter in BRCA2 gene (Mazoyer, S. et al. 
A polymorphic stop condon in BRCA2, Nat Genet 14, 253-4. (1996) which is incorporated 
herein by reference). However, until independent groups analyze a larger number of cases, the 
possibility that such polymorphisms are associated with a modest increased cancer risk or are 
associated with other phenotypes in the heterozygous or homozygous state camot be excluded. 
Methods 

[0073] Cell Lines. Eighty cell lines derived from human tumors were used in this 
study. Forty-four were hematopietic cancer cell lines and 36 were soHd tumors cell lines (for 
detailed list, see Table 3). As controls, six lymphoblastoid cell lines made from peripheral blood 
lymphocytes of patients with Alzheimer's disease by transformation with Epstein Barr Vims 
(EBV) were used. All the cell lines were obtained from the American Type Culture Collection 
(ATCC) (Manassas, VA) and maintained according to ATCC instmctions. 

[0074] Patient Samples. Experimental samples wctc derived from sporadic tumors or 
from peripheral blood of patients with familial cancer (total of 325), Control samples were 
dOTved from blood of patients with diseases other than cancer or from healthy individuals (total 
of 475). All samples were obtained with informed consent following institutional guidelines for 
the protection of human subjects. The 216 human sporadic turgors analyzed include 65 thyroid 
tumors, 58 colorectal adenocarcinomas, 48 breast carcinomas, 39 B-CLLs, 5 lung carcinomas 
and 1 idiopathyc pancytopenia. The panels of DNA from blood include: a) 69 DNA samples 
from females with BRCA-1 - and BRCA-2-negative familial breast cancer; b) 17 DNA samples 
from males affected with prostate cancer and malignant melanoma which had been found 
negative for mutations at the pl6 locus; c) 17 DNAs from patients with familial CLL (at least 
two first-degree relatives affected) and d) 6 DNAs from individuals with pancreatic cancer or 
melanoma who have a family history of at least one case of melanoma or pancreatic cancer and 
negative for mutations in the pl6 and pl4 genes. Patients' profile was similar for both groups: 
about 60% of cancer patients were from European Caucasian origin and the remaining 40% were 
from US persons. In the control group the proportions of the two cohorts were 75% and 25%, 
respectively. No bias toward distkict population groups (such as Ashkenazim) was noted. Hig^ 
molecular weight (BMW) DNA was extracted by conventional protocols (Sambrook, J., Frisch, 
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E.F. & Maniatis, T. Molecular cloning: A Laboratory Manual, (Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989),which is incorporated herein by reference). 

[00751 Rapid Amplification of cDNA Ends (RACE). The 3' and 5' ends of noRNAs 
were obtained by RACE firom human testis, fetal liver, bone marrow and lymph node, using the 
Marathon-ready and the SMART RACE protocols (Clontech, Palo Alto, CA). The PCR 
products were separated on 1.0-2.0% agarose gels and gel purified using the QIAquick gel 
extraction kit (QIAGEN) or cloned in TA vector using TOPO TA Cloning (Invitrogen Carlsbad, 
CA) and sequenced. 

[0076] Northern blot analysis. Human multiple tissues Northern blots were purchased 
from Clontech and total RNA was extracted from tumor cell lines or tumors by the QIAGEN 
RNeasy mini kit (QIAGEN) according to the manufacturer's protocol. The membranes were 
hybridized with a 443-bp probe containing the majority of the ARTSl open reading frame (ORF) 
labeled with ^^P dCTP by random priming (Prime-it n Kit, Stratagene). Prehybridization and 
hybridization were carried out in Church Buffer (7% SDS, 0.5M phosphate buffer pH 7.2, 10 
mM EDTA) for 18-20 h at 65°C as described in Sambrook SUPRA. 

[0077] Reversed Transcription PCR (RT-PCR) analysis. The DNA sequence was 
confirmed by RT-PCR and a semiquantitative RT-PCR was performed to analyze the levels of 
gene expression in different normal and tumor tissues. Five microliters of cDNA were used for 
each PCR with Advantage2 PCR kit (Clontech) and 10 pmol of each gene-specific prinaer for 35 
cycles of 94''C for 20 s, SS^'C for 30 s, 68*=*C for 1 min (for a complete list of primers used in this 
study, see Table 4). To ensure that the RNA was of sufficient purity for RT-PCR, a PCR assay 
with primers specific for GAPDH cDNA (Clontech) was used. Semiquantitative PCR was 
performed with 23 cycles of ampUfication for ARTSl gene and 18 cycles for GAPDH, in order 
to remain within a range of linear increase in the amount of PCR product. RT-PCR products 
were separated by agarose gel electrophoresis and blotted on Hybond N + nylon membranes 
following standard procedures in Sambrook SUPRA. Membranes were hybridized with the 
same probe and in the same conditions as for Northern blotting. The relative intensity of 
hybridization signals was analyzed with a Phospholmager system (Molecular Dynanrics). 

[0078] Methylation analysis by Southern blotting. In order to identify the global level 
of methylation for the ARTSl locus, five micrograms of total genomic DNA were digested with 
Bgin alone or in combination with methylation-sensitive Hpall (Roche) using a total of 40 U of 
enzyme for 12 h. Digests were electrophoresed on 0,8% agarose gels and blotted on Hybond N+ 
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positively charged nylon membranes (Amersham Pharmacia Biotech) and hybridized with the 
same ORF probe as described before. 

[0079] Methylation-specific PGR. To analyze methylation levels in the 5' upstream 
region of ARTS 1, a region upstream of the first exon on ARTSl was amplified and bisulfite 
sequencing was carried out as described in Fronmier, M. et al. A genomic sequencing protocol 
that yields and positive display of S-methylcytosiae residues in individual DNA strands. Proc 
natl Acad Sci U S A 89, 1 827-3 L (1992), which is incorporated herein by reference. Modified 
DNA(200ng) was subjected to PGR. PGR products were purified and directly sequenced in 
order to obtain average methylation levels. In addition, PGR products were subcloned and at 
least six clones were sequenced to confirm direct sequencing data. Because of the xmavoidable 
contamination of normal cells in the tumor specimens, we defined a GpG site as 
"hypermethylated" when more than 70% of PGR products contained bisulfite-resistant cytosines 
"Partial methylation" indicates detection of these products in 20-70% of the total products. 

[0080] LOH studies. The paired normal and colorectal tumor DNA samples were 
tested for LOH by PGR amplification with oligonucleotide primers- for microsatellite markers at 
D13S165 and D13S273 using fluorescent-labeled primers (ABI). One single nucleotide 
polymorphism found inside the ORF of ARTSl (T442C) was heterozygous m about 45% of 
sequenced samples and was very usefiil for the rapid discrimination of 
informative/noninformative patients. The amplification products were run on an Applied 
Biosystems Model 377 DNA sequencing system (PE, AppUed Biosystems). The LOH data for 
39 paired normal/timaor B-GLL samples used in this study were previously reported in Bullrich, 
F. et al. Minimal region of loss at 3ql4 in B-cell chronic lymphocytic leukemia. Blood 88, 
3 1 09-1 5 (1 996), which is incorporated herein by reference. 

[0081] Mutation detection. Primers used in mutation analysis were designed fi-om 
intronic sequences directly upstream of the second exon and within the 3* UTR region of 
ARTSl. PCRs were carried out for 35 cycles of 94°C for 30 s, 62°C for 30 s and 72*^0 for 1 min 
using RedTaq genomic DNA polymerase (Sigma-Aldrich, St, Louis, MO), purified with the 
QIAquick PGR purification kit (QIAGEN) and then both strands were directly sequenced using 
the AppUed Biosystems Model 377 DNA sequencing-system (PE, Applied Biosystems, Foster 
City, CA). The 203 normal controls firom the Italian population w^e analyzed by denatured 
high-performance liquid chromatogr^hy (DHPLC) (Transgenomics, Omaha NE). The 
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temperature used for heteroduplex fonnation was 57^C and all the samples with abnormal 
patterns were directly sequenced. 

[0082] Subcellular localization. The pEGFP Nl - ARTS 1 vector was prepared by 
digesting pEGFP Nl (Clontech) with Smal; the insert was obtained by amplifying the ARTSl 
full-length insert with Pfu where its stop codon was eliminated in order to generate an ARTSl- 
EGFP protein fused at the C-terminus. An additional pEGFP Nl -ARTSl AC-terminus vector 
was prepared carrying the ARTS 1 protein truncated at position 446 of the ORP where the stop 
mutation is located. 293 cells were transfected by calcium phosphate (ProFection from Promega, 
Madison WI) and cultured on a cover slip and 24-48 h after transfection cells were analyzed by 
fluorescence microscopy as described in Ghosh, K. & Ghosh, H.P- Role of the membrane 
anchoring and cytoplasmic domains in intraceUular transport and localization of viral 
glycoprotiens. Biochem Cell Biol 77, 165-78 (1999), which is incorporated herein by reference. 

[0083] Stable transfection of A549 cells. A549 cell line was cultured in RPMI 
supplemented with 10% fetal bovine serum for the following studies. ARTSl expression vector 
P-MV7-ARLTS1 -sense was constructed by ligating the ARTSl open reading frame in sense 
orientation into a mammalian expression vector pMV-7. All constmcts were sequenced in order 
to exclude random mutants and were transfected by FuGENB6 transfection reagent according to 
the protocol (Boehringer Mannhiem). Transfected cells were selected with G418. 

[0084] Analysis of transformed phenotype. Soft-agar colony assay of A549 wild - type 
and ARTSl stable transfectants were performed as described in Trapasso, F. et al. Rat protein 
tyrosine phosphatase eta suppresses the neoplastic phenotype of retrovirally transformed thyroid 
cells through the stabilization of p27 (ECipl). Mol Cell Biol 20, 9236-46.(2000) which is 
incorporated herein by reference. A suspension of 106 cells in PBS (0.2ml) was injected 
subcutaneousy into the right flank of Nu/Nu athymic mice (Jackson Laboratories Charles River, 
Cambridge, MA). Mice were sacrificed after 1,3,5, and 8 weeks and tumors were removed, 
weighed and measured in three dimesions. All experiments were perfomed in accordance with 
institutional guidelines. 

[0085] Statistical analysis. Statistical analysis of results was performed using the 
Fisher's exact test; a P value of <0.05 was considered statistically significant. The cancer risk 
associated with the specific mutations identified in this study was anayzed using the odds ration 
(OR). 
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Table 1 - ARTSl sequence analysis in human cell lines, tumors and normal controls. 



Variant 
name* 


Amino 
acid 

Change 


Amino acid 
conservation 

(%)' 


Cell Lines 
(%) 


Sporadic 
tumors (%) 


Familial 
cancers, 
blood (%) 


Normals, 

blood 

(%) 


TSOtoC 


Met 17 
toThr 


9/14 (65) 


2/80 
(2.5)' 


0/216 


0/109 


0/272 


C65toT 


Set 22 
to Leu 


3/14 (21), PMl 
site 


0/80 


2/216(1)'* 


0/109 


1/272(0.4) 


C262 to 
A 


Leu 88 
to Met 


Leu only in 
ARTSl 


1/80 (1)^ 

^ 


0/216 


0/109 


0/272 


C392 to 
T 


Pro 
131 to 
Leu 


4/14 (29) 


6/80 
(7.5) 


14/216 
(6.5) 


4/109 (4) 


17/272 
(6.25) 


T442to 
C 


Cys 
148 to 
Arg 


Cys only in 
ARTSJ 


25/80 
(31) 


127/216 
(59) 


80/109 
(73) 


182/272 
(67) 


G446to 
A 


Trp 
149 to 
Stop 


12/14 (86) 


2/80 
(2.5) 


8/216 
(3.7) 


6/109 (6) 


10/475 (2.1) 


G490 to 
A 


Glu 
164 to 
Lys 


9/14 (65) 


0/80 


1/216 
(0.5)^ 


0/109 


0/272 



Note: 1 - We identified also several synonymous polymorphisms such as: CI 75 to T 
(Leu 59); G 297 to A (Ser 99); C345 to T (Val 1 15); G396 to C (Leu 132); G546 to A (Gin 182). 

2 - Data obtained by a multiple alignment of ARTSl protein with ARFl to ARF6 and 
ARLl to ARL7 at the GenomeNet CLUSTALW server. 

3 - Found only in melanoma cell lines. 

4 - Found only in thyroid adenomas. 

Table 2 - Clinical data from families with G446A (Trpl49Stop) mutation. 
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Proband, sex, age' 


Cancer type 


Cancer Family history 


KRR0003, female, 46 


B-CLL 


Twin sister G446 A +ve with B-CLL 


TOR-IB, male, 57 


B-CLL and lung 
cancer 


Sister, 53, homozygous G446A with Thyroid 
adenoma; his son, 30, obligate carrier. 
Essential thrombocytOTiia 
Brother, heterozygous G446 A - normal 
Mother, dead, obligate carrier, B-CLL at 
80 yrs-old 

Father, 86, obligate carrier, B-CLL 


P/M 35003, male, ? 


Gastric, 72 
Melanoma, 72 
Prostate, 73 


None 


P/M 35012, male,? 


Prostate, 66 
Melanoma, 67 


Mother, cancer, unknown location, ? 
Brother, prostate, 73 
Sister, "black moles", ? 
Daughter, breast, ? 


1054-22671, male, 
dead 


Melanoma, 50 
Lung metastasis, 55 


Patemal uncle, melanoma, ? 
Paternal aunt, pancreatic, ? 
Patemal cousin, pancreatic, ? 
Patemal cousin, head and neck, ? 


15-265-S87, female, ? 


Bilateral breast 
cancer, 32 and 35 
Ovarian cancer, 50 


Daughter, 48, G446A carrier, imaffected 



Note: 1 - ? = Age data unknown 



Table 3 - Cell lines used in the described experiments 
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Hystotype j ^ 


Cell lines used 


Burkitt's lymphoma 1 


AXj876, AS283, BL2, BL30, BL41, CA46, DAy /S, 
Daudi, EB-B, ED36, Jiyoye, Lauckes, Nanalwa, 
P3HR-1, Raji, Ilamos, RSI 1864, SKDHL and WMN 


Multiple myeloma 1 


WiiKr«?l MC/CAR. NC1-H929, RPM18226 and 
U266B1 


Large cell lymphoma 


DBandSR 


hnmunoblastic B cell lymphoma 


JMl 


Diffuse mix lymphoma 


HT 


Hodgkin's disease 


RPMl 6666 and Hs445 


Non-Hodgkin's disease 


1 • 


B-ALL 


TUVA- 11 RS4*11 697 


T-cell lymphomas and leukemias 


rVM Del 1 HH HSB2. HuT 102, MOLT-3, MOLT- 
4, and MJ 


Hairy cell leukemia 


1 Mo T 


CML-Erythroid leukemia 


K562 


Lung carcinomas 


A549, AFL, Calu-3, H69, H460, H1299, SKMES. 4914 
and 1285 


Pancreatic carcinomas 


AsPCl, BxPC3, Capan-2, CFPAC-1, HS766T, 
MiaPaca, PANCl, PSNl, and SU8686 


lEsophageal cancers 


TEl, TE2, ElO, TEl 5, KY200 and KY300 


MaUgnant melanoma 


M14, 1007 MP, IR 6, WM 266.4, 397 MEL, 13443 
and four cell lines derived from melanoma patiaats 
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Colon carcinoma 


LoVo 


Cervical carcinoma 


HeLa 



Table 4 - Primers used in the described experiments 



Primer name 


Primer sequence (5 '-3') 


Application 


3*-ex2F 


5' - CCA TGG GTT CTG TGA ATT 
CCA GAG G (SEQ ID N0:5) 


Northern blot analysis 


5'-ex2R2 


5 ' - CAG TGG TCC TGG AAT CTC 
TCT AGA C (SEQ ID NO:6) 


3'exlF 


5' - GCC AGC AGA AAG CAG CTC 
CAT AGG (SEQ ID NO:7) 


Reversed Transcription PCT 
(RT-PCR) analysis 


5'ex2Rl 


5' - TTC AGG AGG CTC CAC AGG 
CTC TGC (SEQ ID NO:8) 


MET-F 


5' - GAG GTA TGT ATT GAA AG 
AAG AGG (SEQ ID NO:9) 


Methylation-specific PGR 


MET-R 


5' - AAC AAA ACC CAA TAA CAA 
CTC CA (SEQ ID NO: 10) 


ORFrFl 


5' - CAG AAG ACA GTA GCT GAT 
GTG (SEQ ID NO: 11) 


Genomic Mutation detection 


ORF-R2 


5' - GAG CAA AGA TAT GCT GCT 
CTG (SEQ ID NO: 12) 


Mael-Fl 


5' - GCT GAG TCC AGA GAG ATT 
CCA GG (SEQ ID NO: 13) 


G446A (Tipl49Stop) 
detection by Mje/ digestion 
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Maei-Ki 


J - lUl C^UL/ L^lvj L^Avj AUA UAi 






GC(SEQIDNO:14) 





Table 5 - Expression levels and methylation status of the ARLTSI promoter in human 
cancer cell lines 



Name 


Origin 


ARTSl expression^ 


Methylation 


Normal lung 1 


Normal lung 


-f 


Low 


Normal lung 2 


Normal lung 




Low 


A 549 


Lung carcinoma 


□• 


Hypermethylation 


AFL 


Lung carcinoma 


□/+ 


Hypermethylation 


Calu-3 


Limg carcinoma 




ND 


H1299 


Lung carcinoma 




Hypermethylation 


H69 


Lung carcinoma 




ND 


1285 


Lung carcinoma 


a/+ 


ND 


H460 


Lung carcinoma 


:□/+ 


Hypermethylation 


Lymphoblastoid 1 


Immortalized lymphoblasts 


+ 


Low 


Lymphoblastoid 2 


Immortalized lymphoblasts 




Low 


Dell 


T cell lymphoma 


□/+ 


Hypermethylation 


HH 


T cell lymphoma 


.□ 


Hypermethylation 


HSB2 


TceU ALL 


□ 


Hypermethylation 


HuT 102 


T ceU lymphoma 




Hypermethylation 


K562 


CML-Erythroid leukemia 


□ 


ND 


MJ 


T cell lymphoma 


□ 


Hypermethylation 
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MoT 


T cell lymphoma 


□ 


Hypermetfaylation 


AS 283 


Burkitt's lymphoma 


+ 


Low 


BL41 


Burkitt's lymphoma 


+ 


Low 


PSN 1 


Pancreatic carcinoma 




Hypermethylation 


MiaPaca 


Pancreatic carcinoma 


□/+ 


Hypennethylation 


HeLa 


Cervical carcinoma 


□ 


Hypennethylation 


SW480 


Colon carcinoma 


D 


ND 


G-361 


Melanoma 


.□ 


ND 



a : +5 normal expression; +A, reduced expression and absent expression; ND - not done 



Table 6 - Allele frequency of G446A (Trpl49Stop) in unrelated cancer patients 
and control cases. 



Cancer patients 


Normal controls 


Tumor type 


Source 


Sample 

size, 
origin 


G446 

A 


Source 


Sample 

size, 

origin 


G446 
A 


Colorectal cancers 
"sporadic" 


Bucharest, 
Romaaoia 


58, 

tumor 


2 


Philadelp 
hia 


116, 
blood 


1 


Breast "sporadic" 


Ferrara, 
Italy 


38, 

tumor 


3 


Bucharest 


156, 
blood 


2 


Breast "sporadic" 


Aarhus, 
Deimiark 


10, 

tumor 


0 


Ferrara 


203, 
blood 


7 


CLL "sporadic" 


US 


39, 

timior 


0 








Lung **sporadic" 


Milan^ 


5, tumor 


1 
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Italy 












in5rroia sporadic 


Cataozaro, 
Italy 


65, 
tumor 


1 








CLL lamiual 


Paris, 
France 


11, 

blood 


1 








CLL familial 


US 


6, blood 


1 








Breat familial 


Philadelphi 
a, PA 


69, 
blood 


1 








Melanoma + 
prostate 


Philadelphi 
a,PA 


17, 

blood 


2 








Pancreatic + 
melanoma 


Philadelphi 

a, 

PA 


6, blood 


1 








Idiopathyc 
Pancytopenia 


Bucharest, 
Romania 


1, blood 


I 








Total 




325 


14 

(4.30 
%) 




475 


10 

(2.10 
%) 



-31- 



